SEQUENCE LISTING 



<110> Nilles, Matthew L . 
Matson, Jyl S. 

<120> YERSINIA SPECIES COMPOSITIONS 

<130> 3128-6046US 

<150> US 60/444,076 

<151> 2003-01-31 

<160> 20 

<170> Patentln version 3.2 

<210> 1 

<211> 87 

<212> PRT 

<213> Yersinia pestis 
<220> 

<223> amino acid sequence of YscF 
<220> 

<223> sequence can be found at MedLine accession number NP 857921.1 

<400> 1 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp He Ala Asp Leu 
1 5 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser He Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys Trp Ser Val He 
50 55 60 



Tyr Asn He Asn Ser Thr He Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly He Leu Gin Lys Phe Pro 
85 



1 



<210> 2 
<211> 264 
<212> DNA 

<213> Yersinia pestis 
<220> 

<223> nucliec acids encoding YscF 
<220> 

<223> sequence can be found at MedLine accession number NL_004839 
<400> 2 

atgagtaact tctctggatt tacgaaagga accgatatcg cagacttaga tgcggtggct 60 
caaacgctca agaagccagc agacgatgca aacaaagcgg ttaatgactc gatagcagca 12 0 
ttgaaagata agcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 180 
tggtcggtaa tttacaatat aaactcaacc atagttcgta gcatgaaaga cttaatgcaa 24 0 
ggcatcctac agaagttccc ataa 264 

<210> 3 
<211> 264 
<212> DNA 

<213> Yersinia enterocolitica 
<220> 

<223> nucleic acid encoding YscF from Y. enterocolitica 
<220> 

<223> sequence can be found at MedLine accession number NC_005017 
<400> 3 

atgagtaatt tctctgggtt tgcaaaagga accgatatca cagacttaga tgcggtggct 60 
caaacgctca agaagccagc agacggcgca aacaaagcgg ttaatgactc gatagcagcg 12 0 
ttgaaagaga cgcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 18 0 
tggtcggtaa tttacaatat aagctcaacc atagttcgta gcatgaaaga cttaatgcaa 240 
ggcatcctac agaagttccc ataa 2 64 



<210> 4 
<211> 87 
<212> PRT 



2 



<213> Yersinia enterocolitica 



<220> 

<223> amino acid sequence of YscF from Y. enterocolitica 
<220> 

<223> sequence can be found at MedLine accession number NP_863538.1 
<400> 4 

Met Ser Asn Phe Ser Gly Phe Ala Lys Gly Thr Asp He Thr Asp Leu 
15 10 15 

Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala Val Asn Asp Ser He Ala Ala Leu Lys Asp Thr Pro Asp Asn Pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys Trp Ser Val He 
50 55 60 

Tyr Asn Met Ser Ser Thr He Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly He Leu Gin Lys Phe Pro 
85 



<210> 5 
<211> 80 
<212> PRT 

<213> Salmonella enterica 
<220> 

<223> amino acid sequence of PrgI 
<220> 

<223> sequence can be found at MedLine accession number CAD05980.1 
<400> 5 

Met Pro Thr Ser Trp Ser Gly Tyr Leu Asp Glu Val Ser Ala Lys Phe 
15 10 15 



3 



Asp Lys Gly Val Asp Asn Leu Gin Thr Gin Val Thr Glu Ala Leu Asp 
20 25 30 



Lys Leu Ala Ala Lys Pro Ser Asp Pro Ala Leu Leu Ala Ala Tyr Gin 
35 40 45 

Ser Lys Leu Ser Glu Tyr Asn Leu Tyr Arg Asn Ala Gin Ser Asn Thr 
50 55 60 

Val Lys Val Phe Lys Asp lie Asp Ala Ala He He Gin Asn Phe Arg 
65 70 75 80 



<210> 6 
<211> 243 
<212> DNA 

<213> Salmonella enterica 
<220> 

<223> sequence can be found at MedLine accession number AL627276 
<220> 

<223> nucleic acid encoding PrgI 
<400> 6 

atgccaacat cttggtcagg ctatctggat gaagtttcag caaaatttga taagggcgtt 60 
gataatctac aaacgcaggt aacagaggcg ctggataaat tagcagcaaa accctccgat 12 0 
ccggcgctac tggcggcgta tcagagtaag ctctcggaat ataacttgta ccgtaacgcg 180 
caatcgaaca cggtaaaagt ctttaaggat attgatgctg ccattattca gaacttccgt 24 0 
taa 243 

<210> 7 

<211> 83 

<212> PRT 

<213> Shigella flexneri 
<220> 

<223> amino acid sequence of MxiH 
<220> 

<223> sequence can be found at MedLine accession number NP_8582 70.1 

<400> 7 



4 



Met Ser Val Thr Val Pro Asn Asp Asp Trp Thr Leu Ser Ser Leu Ser 
15 10 15 



Glu Thr Phe Asp Asp Gly Thr Gin Thr Leu Gin Gly Glu Leu Thr Leu 
20 25 30 

Ala Leu Asp Lys Leu Ala Lys Asn Pro Ser Asn Pro Gin Leu Leu Ala 
35 40 45 



Glu Tyr Gin Ser Lys Leu Ser Glu Tyr Thr Leu Tyr Arg Asn Ala Gin 
50 55 60 



Ser Asn Thr Val Lys Val He Lys Asp Val Asp Ala Ala He He Gin 
65 70 75 80 

Asn Phe Arg 



<210> 8 
<211> 252 
<212> DNA 

<213> Shigella flexneri 
<220> 

<223> nucleic acid encoding MxiH 
<220> 

<223> sequence can be found at MedLine accession number Nc_004851 
<400> 8 

atgagtgtta cagtaccgaa tgatgattgg acattgagtt cattatctga aacttttgat 60 
gatggaactc aaacattaca aggtgaacta acattggcac tagataaatt agctaaaaat 12 0 
ccttcgaatc cacagttgct ggctgaatac caaagtaaat tatctgaata tacattatat 18 0 
aggaacgcgc aatccaatac agtgaaagtg attaaggatg ttgatgctgc aattattcaa 24 0 
aacttcagat aa 252 



<210> 9 
<211> 73 
<212> PRT 



5 



<213> Escherichia coli 



<220> 

<223> amino acid sequence of EscF 
<220> 

<223> sequence can be found at MedLine accession number NP_312579.1 
<400> 9 

Met Asn Leu Ser Glu lie Thr Gin Gin Met Gly Glu Val Gly Lys Thr 
15 10 15 

Leu Ser Asp Ser Val Pro Glu Leu Leu Asn Ser Thr Asp Leu Val Asn 
20 25 ' 30 

Asp Pro Glu Lys Met Leu Glu Leu Gin Phe Ala Val Gin Gin Tyr Ser 
35 40 45 

Ala Tyr Val Asn Val Glu Ser Gly Met Leu Lys Thr lie Lys Asp Leu 
50 55 60 

Val Ser Thr lie Ser Asn Arg Ser Phe 
65 70 



<210> 10 
<211> 222 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> nucleic acid encoding EscF 
<220> 

<223> sequence can be found at MedLine accession number NC_002695 
<400> 10 

atgaatttat ctgaaattac tcaacaaatg ggtgaagtag gtaaaacgct gagcgattct 60 
gtgccagagt tacttaatag caccgatttg gttaatgacc ctgaaaaaat gttagagttg 12 0 
cagtttgcgg ttcagcaata ttctgcttat gttaacgtag aaagtggaat gttgaaaacg 180 
ataaaagatc tggtctcaac catttctaac cgtagttttt aa 222 



6 



<210> 11 

<211> 240 

<212> DNA 

<213> Yersinia pestis 
<220> 

<223> nucleic acid encoding HT-YscF; an example of a recombinant YscF 
<220> 

< 2 2 1 > mi sc_f ea t ure 

<222> (214) . . (237) 

<223> His-tag sequence 

<400> 11 

atgagtaact tctctggatt tacgaaagga accgatgatg caaacaaagc ggttaatgac 60 

tcgatagcag cattgaaaga taagcctgac aacccggcgc tacttgctga cttacaacat 12 0 

tcaattaata aatggtcggt aatttacaat ataaactcaa ccatagttcg tagcatgaaa 180 

gacttaatgc aaggcatcct acagaagttc ccactcgagc accaccacca ccaccactga 24 0 

<210> 12 

<211> 95 

<212> PRT 

<213> Yersinia pestis 
<220> 

<223> amino acid sequence of HT-YscF; an example of a recombinant YscF 
<220> 

<221> misc_feature 

<222> (88) . . (95) 

<223> His-tag 

<400> 12 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



7 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn lie Asn Ser Thr He Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly He Leu Gin Lys Phe Pro Leu Glu His His His His His His 
85 90 95 



<210> 13 

<211> 5537 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pMJ119 
<220> 

<221> rnisc__f eature 

<222> (563) . . (1375) 

<223> kan encoded on complementary strand 
<220> 

<221> misc_feature 

<222> (3518) . . (4597) 

<223> LacI encoded on complementary strand 
<220> 

<221> promoter 

<222> (4983) . . (5000) 

<223> T7 promoter 

<220> 

<221> misc_f eature 

<222> (5001) . . (5001) 

<223> T7 transcription start 

<220> 

< 2 2 1 > mi sc_f ea t ure 

<222> (5072) . . (5104) 

<223> T7 Tag 

<220> 

<221> CDS 

<222> (5114) . . (5401) 

<223> HT-YscF 

<220> 



8 



< 2 2 1 > mi s c_f ea ture 
<222> (5381) . . (5398) 
<223> His-tag sequence 

<220> 

<221> terminator 
<222> (5466) . . (5512) 
<223> T7 terminator 

<400> 13 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 

cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 12 0 

ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 

gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 24 0 

acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 3 00 

ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 

ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 

acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 

tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 54 0 

tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 

tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 

actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 72 0 

gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 

aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 

agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 

cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 

aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 

tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 

tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 114 0 

taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 

ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 
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tcgcacctga 


ttgcccgaca 


ttatcgcgag 


cccatttata cccatataaa 


tcagcatcca 


1320 


tgttggaatt 


taatcgcggc 


ctagagcaag acgtttcccg ttgaatatgg ctcataacac 


1380 


cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 


1440 


cgtgagtttt 


cgttccactg 


agcgtcagac 


cccgtagaaa agatcaaagg 


atcttcttga 


1500 


gatccttttt 


ttctgcgcgt 


aatctgctgc 


ttgcaaacaa aaaaaccacc 


gctaccagcg 


1560 


gtggtttgtt 


tgccggatca 


agagctacca 


actctttttc cgaaggtaac 


tggcttcagc 


1620 


agagcgcaga 


taccaaatac 


tgtccttcta 


gtgtagccgt agttaggcca 


ccacttcaag 


1680 


aactctgtag 


caccgcctac 


atacctcgct 


ctgctaatcc' tgttaccagt 


ggctgctgcc 


1740 


agtggcgata 


agtcgtgtct 


taccgggttg 


gactcaagac gatagttacc 


ggataaggcg 


1800 


cagcggtcgg gctgaacggg gggttcgtgc 


acacagccca gcttggagcg 


aacgacctac 


1860 


accgaactga 


gatacctaca 


gcgtgagcta 


tgagaaagcg ccacgcttcc 


cgaagggaga 


1920 


aaggcggaca ggtatccggt 


aagcggcagg gtcggaacag gagagcgcac 


gagggagctt 


1980 


ccagggggaa 


acgcctggta 


tctttatagt 


cctgtcgggt ttcgccacct 


ctgacttgag 


2040 


cgtcgatttt 


tgtgatgctc 


gtcagggggg 


cggagcctat ggaaaaacgc 


cagcaacgcg 


2100 


gcctttttac 


ggttcctggc 


cttttgctgg 


ccttttgctc acatgttctt 


tcctgcgtta 


2160 


tcccctgatt 


ctgtggataa 


ccgtattacc 


gcctttgagt gagctgatac 


cgctcgccgc 


2220 


agccgaacga 


ccgagcgcag 


cgagtcagtg 


agcgaggaag cggaagagcg 


cctgatgcgg 


2280 


tattttctcc 


ttacgcatct 


gtgcggtatt 


tcacaccgca tatatggtgc 


actctcagta 


2340 


caatctgctc 


tgatgccgca 


tagttaagcc 


agtatacact ccgctatcgc 


tacgtgactg 


2400 


ggtcatggct 


gcgccccgac 


acccgccaac 


acccgctgac gcgccctgac 


gggcttgtct 


2460 


gctcccggca 


tccgcttaca 


gacaagctgt 


gaccgtctcc gggagctgca 


tgtgtcagag 


2520 


gttttcaccg 


tcatcaccga 


aacgcgcgag 


gcagctgcgg taaagctcat 


cagcgtggtc 


2580 


gtgaagcgat 


tcacagatgt 


ctgcctgttc 


atccgcgtcc agctcgttga 


gtttctccag 


2640 


aagcgttaat 


gtctggcttc 


tgataaagcg 


ggccatgtta agggcggttt 


tttcctgttt 


2700 


ggtcactgat 


gcctccgtgt 


aagggggatt 


tctgttcatg ggggtaatga 


taccgatgaa 


2760 
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acgagagagg atgctcacga 


tacgggttac 


tgatgatgaa catgcccggt tactggaacg 


2820 


ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 


2880 


tcaatgccag 


cgcttcgtta atacagatgt 


aggtgttcca 


cagggtagee agcagcatcc 


2940 


tgcgatgcag atccggaaca 


taatggtgca gggcgctgac 


ttccgcgttt ccagacttta 


3000 


cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 


3060 


qcaqtcactt 


cacgttcgct 


ccrccrtatccrcr 


fccrafc teat t~ r* 


u.y \~. L-aaL. t-cty uctcty y OaaLL 


3120 


ccgccagcct 


^cfccgggtcc 


tcaaccracacr 


y >-»CT.^»y d lv 


aL, y^'y ^"■^^^ y ^yyyy*--^— y^ 


3180 


catcrccaaccr 


ataatggcct 


acttctcacc 

\j ^* w w Vm» 


craaaccrt t~ t~ rr 


y L yy t -yyy« c cayugacgaa 


3240 


ggcttgagcg 


agggcgtgca 


agattccgaa 


taccgcaagc 


gaeaggcega tcatcgtcgc 


3300 


gctccagcga 


aagcggtcct 


cgccgaaaat 


gacccagagc 


gctgccggca cctgtcctac 


3360 


gagttgcatg 


ataaagaaga 


cagtcataag 


tgcggcgacg 


atagtcatgc cccgcgccca 


3420 


ccggaaggag 


ctgactgggt 


tgaaggctct 


caagggcatc 


ggtcgagatc ccggtgccta 


3480 


atgagtgagc 


taacttacat 


taattgcgtt 


gcgctcactg 


cccgctttcc agtcgggaaa 


3540 


cctgtcgtgc 


cagctgcatt 


aatgaatcgg 


ccaacgcgcg 


gggagaggcg gtttgegtat 


3600 


tgggcgccag 


ggtggttttt 


cttttcacca 


gtgagacggg 


caacagctga ttgcccttca 


3660 


ccgcctggcc 


ctgagagagt 


tgcagcaagc 


ggtccacgct 


ggtttgcccc ageaggegaa 


3720 


aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 


3780 


atcccactac 


cgagatatcc 


gcaccaacgc 


gcagcccgga 


cteggtaatg gcgcgcattg 


3840 


cgcccagcgc 


catctgatcg 


ttggcaacca 


geategcagt 


gggaacgatg ccctcattca 


3900 


gcatttgcat 


ggtttgttga 


aaaccggaca 


tggcactcca 


gtcgccttcc cgttccgcta 


3960 


tcggctgaat 


ttgattgcga 


gtgagatatt 


tatgccagcc 


agccagacgc agacgcgccg 


4020 


agacagaact 


taatgggccc 


gctaacagcg 


egatttgetg 


gtgacccaat gcgaccagat 


4080 


gctccacgcc 


cagtcgcgta 


ccgtcttcat 


gggagaaaat 


aatactgttg atgggtgtct 


4140 


ggtcagagac 


atcaagaaat 


aacgccggaa 


cattagtgea 


ggcagcttcc acagcaatgg 


4200 


catcctggtc 


atccagcgga 


tagttaatga 


tcagcccact 


gacgcgttgc gcgagaagat 


4260 


tgtgcaccgc 


cgctttacag gcttcgacgc 


cgcttcgttc 


taccatcgac accaccacgc 


4320 
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tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4 3 80 

gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 444 0 

ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 

tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4 560 

catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4 62 0 

cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 

tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 

ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4 800 

ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 

cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4 92 0 

gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4 980 

aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 504 0 

ttttgtttaa ctttaagaag gagatataca tatggctagc atgactggtg gacagcaaat 5100 

gggtcgggat ccg atg agt aac ttc tct gga ttt acg aaa gga acc gat 514 9 
Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp 
15 10 

ate gca gac tta gat gcg gtg get caa acg etc aag aag cca gca gac 5197 
lie Ala Asp Leu Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp 
15 20 25 

gat gca aac aaa gcg gtt aat gac teg ata gca gca ttg aaa gat aag 5245 
Asp Ala Asn Lys Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys 
30 35 40 

cct gac aac ccg gcg eta ctt get gac tta caa cat tea att aat aaa 52 93 
Pro Asp Asn Pro Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys 
45 50 55 60 

tgg teg gta att tac aat ata aac tea acc ata gtt cgt age atg aaa 5341 
Trp Ser Val He Tyr Asn He Asn Ser Thr He Val Arg Ser Met Lys 
65 70 75 

gac tta atg caa ggc ate eta cag aag ttc cca etc gag cac cac cac 5389 
Asp Leu Met Gin Gly He Leu Gin Lys Phe Pro Leu Glu His His His 
80 85 90 
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cac cac cac tga gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg 
His His His 
95 



5441 



ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga 5501 
ggggtttttt gctgaaagga ggaactatat ccggat 553 7 



<210> 14 
<211> 5309 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pET24b 
<220> 

<221> terminator 
<222> (26).. (72) 

<223> T7 terminator 

<220> 

<221> misc_feature 
<222> (140) . . (157) 

<223> His-tag sequence 

<220> 

<221> misc_feature 
<222> (206) . . (238) 

<223> T7 Tag 

<220> 

< 2 2 1 > mi sc_f ea tur e 
<222> (309) . . (309) 

<220> 

<221> misc_f eature 

<222> (309) . . (309) 

<223> T7 transcription start 

<220> 

<221> promoter 

<222> (310) . . (327) 

<223> T7 promoter 

<220> 

< 2 2 1 > mis c_f eature 
<222> (713) . . (716) 

<223> Alternate start codon H gtg" 
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<220> 

<221> CDS 

<222> (713) . . (1792) 

<223> lad 

<220> 

<221> CDS 

<222> (3935) . . (4747) 

<223> kan 

<400> 14 

atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60 

ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 12 0 

tgttagcagc cggatctcag tggtggtggt ggt'ggtgctc gagtgcggcc gcaagcttgt 180 

cgacggagct cgaattcgga tcccgaccca tttgctgtcc accagtcatg ctagccatat 24 0 

gtatatctcc ttcttaaagt taaacaaaat tatttctaga ggggaattgt tatccgctca 300 

caattcccct atagtgagtc gtattaattt cgcgggatcg agatctcgat cctctacgcc 360 

ggacgcatcg tggccggcat caccggcgcc acaggtgcgg ttgctggcgc ctatatcgcc 42 0 

gacatcaccg atggggaaga tcgggctcgc cacttcgggc tcatgagcgc ttgtttcggc 4 80 

gtgggtatgg tggcaggccc cgtggccggg ggactgttgg gcgccatctc cttgcatgca 54 0 

ccattccttg cggcggcggt gctcaacggc ctcaacctac tactgggctg cttcctaatg 600 

caggagtcgc ataagggaga gcgtcgagat cccggacacc atcgaatggc gcaaaacctt 660 

tcgcggtatg gcatgatagc gcccggaaga gagtcaattc agggtggtga at gtg aaa 718 

Met Lys 
1 

cca gta acg tta tac gat gtc gca gag tat gcc ggt gtc tct tat cag 766 
Pro Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala Gly Val Ser Tyr Gin 
5 10 15 

acc gtt tec cgc gtg gtg aac cag gcc age cac gtt tct gcg aaa acg 814 
Thr Val Ser Arg Val Val Asn Gin Ala Ser His Val Ser Ala Lys Thr 
20 25 30 

egg gaa aaa gtg gaa gcg gcg atg gcg gag ctg aat tac att ccc aac 862 
Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu Asn Tyr lie Pro Asn 
35 40 45 50 

cgc gtg gca caa caa ctg gcg ggc aaa cag teg ttg ctg att ggc gtt 910 
Arg Val Ala Gin Gin Leu Ala Gly Lys Gin Ser Leu Leu lie Gly Val 
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55 



60 



65 



gcc acc tec agt ctg gec ctg cac gcg ccg teg caa att gtc gcg gcg 958 
Ala Thr Ser Ser Leu Ala Leu His Ala Pro Ser Gin lie Val Ala Ala 
70 75 80 

att aaa tct cgc gcc gat caa ctg ggt gcc age gtg gtg gtg teg atg 1006 
lie Lys Ser Arg Ala Asp Gin Leu Gly Ala Ser Val Val Val Ser Met 
85 90 95 

gta gaa cga age ggc gtc gaa gcc tgt aaa gcg gcg gtg cac aat ctt 1054 
Val Glu Arg Ser Gly Val Glu Ala Cys Lys Ala Ala Val His Asn Leu 
100 105 110 

etc gcg caa cgc gtc agt ggg ctg ate att aac tat ccg ctg gat gac 1102 
Leu Ala Gin Arg Val Ser Gly Leu lie lie Asn Tyr Pro Leu Asp Asp 
115 120 125 130 

cag gat gcc att get gtg gaa get gcc tgc act aat gtt ccg gcg tta 1150 
Gin Asp Ala lie Ala Val Glu Ala Ala Cys Thr Asn Val Pro Ala Leu 
135 140 145 

ttt ctt gat gtc tct gac cag aca ccc ate aac agt att att ttc tec 1198 
Phe Leu Asp Val Ser Asp Gin Thr Pro lie Asn Ser lie lie Phe Ser 
150 155 160 

cat gaa gac ggt acg cga ctg ggc gtg gag cat ctg gtc gca ttg ggt 124 6 

His Glu Asp Gly Thr Arg Leu Gly Val Glu His Leu Val Ala Leu Gly 
165 170 175 

cac cag caa ate gcg ctg tta gcg ggc cca tta agt tct gtc teg gcg 12 94 

His Gin Gin lie Ala Leu Leu Ala Gly Pro Leu Ser Ser Val Ser Ala 
180 185 190 

cgt ctg cgt ctg get ggc tgg cat aaa tat etc act cgc aat caa att 1342 
Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu Thr Arg Asn Gin lie 
195 200 205 210 

cag ccg ata gcg gaa egg gaa ggc gac tgg agt gcc atg tec ggt ttt 13 90 

Gin Pro lie Ala Glu Arg Glu Gly Asp Trp Ser Ala Met Ser Gly Phe 
215 220 225 

caa caa acc atg caa atg ctg aat gag ggc ate gtt ccc act gcg atg 14 3 8 

Gin Gin Thr Met Gin Met Leu Asn Glu Gly lie Val Pro Thr Ala Met 
230 235 240 

ctg gtt gcc aac gat cag atg gcg ctg ggc gca atg cgc gcc att acc 14 86 

Leu Val Ala Asn Asp Gin Met Ala Leu Gly Ala Met Arg Ala lie Thr 
245 250 255 

gag tec ggg ctg cgc gtt ggt gcg gat ate teg gta gtg gga tac gac 1534 
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Glu Ser Gly Leu Arg Val Gly Ala Asp lie Ser Val Val Gly Tyr Asp 
260 265 270 



gat acc gaa gac age tea tgt tat ate ccg ccg tta ace ace ate aaa 1582 
Asp Thr Glu Asp Ser Ser Cys Tyr He Pro Pro Leu Thr Thr He Lys 
275 280 285 290 

cag gat ttt cgc ctg ctg ggg caa acc age gtg gac cgc ttg ctg caa 163 0 

Gin Asp Phe Arg Leu Leu Gly Gin Thr Ser Val Asp Arg Leu Leu Gin 
295 300 305 

etc tct cag ggc cag gcg gtg aag ggc aat cag ctg ttg ccc gtc tea 1678 
Leu Ser Gin Gly Gin Ala Val Lys Gly Asn Gin Leu Leu Pro Val Ser 
310 315 320 

ctg gtg aaa aga aaa acc acc ctg gcg ccc aat acg caa acc gee tct 1726 
Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Asn Thr Gin Thr Ala Ser 
325 330 335 

ccc cgc gcg ttg gee gat tea tta atg cag ctg gca cga cag gtt tec 1774 
Pro Arg Ala Leu Ala Asp Ser Leu Met Gin Leu Ala Arg Gin Val Ser 
340 345 350 

cga ctg gaa age ggg cag tgagegcaac gcaattaatg taagttagct 1822 
Arg Leu Glu Ser Gly Gin 
355 360 



cactcattag 


geacegggat 


ctcgaccgat 


geccttgaga 


gccttcaacc 


cagtcagctc 


1882 


cttccggtgg 


gegeggggea 


tgactatcgt 


cgccgcactt 


atgactgtct 


tctttatcat 


1942 


geaactegta 


ggacaggtgc 


cggcagcgct 


ctgggtcatt 


tteggegagg 


accgctttcg 


2002 


ctggagcgcg 


acgatgatcg 


gcctgtcgct 


tgeggtatte 


ggaatcttgc 


acgccctcgc 


2062 


tcaagccttc 


gtcactggtc 


ccgccaccaa 


aegtttegge 


gagaagcagg 


ccattatcgc 


2122 


cggcatggcg 


gccccacggg 


tgegcatgat 


cgtgctcctg 


tcgttgagga 


cccggctagg 


2182 


ctggcggggt 


tgccttactg 


gttagcagaa 


tgaatcaccg 


atacgegage 


gaacgtgaag 


2242 


cgactgctgc 


tgcaaaacgt 


ctgcgacctg 


agcaacaaca 


tgaatggtct 


tcggtttccg 


2302 


tgtttcgtaa 


agtctggaaa 


cgcggaagtc 


agcgccctgc 


accattatgt 


teeggatctg 


2362 


categcagga 


tgctgctggc 


taccctgtgg 


aacacctaca 


tctgtattaa 


egaagegctg 


2422 


gcattgaccc 


tgagtgattt 


ttctctggtc 


ccgccgcatc 


cataccgcca 


gttgtttacc 


2482 


ctcacaacgt 


tccagtaacc 


gggcatgttc 


atcatcagta 


acccgtatcg 


tgagcatcct 


2542 
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ctctcgtttc 


atcggtatca 


ttacccccat 


gaacagaaat 


cccccttaca 


eggaggcate 


2602 


agtgaccaaa 


caggaaaaaa 


ccgcccttaa 


catggcccgc 


tttatcagaa gecagacatt 


2662 


aacgcttctg gagaaactca 


acgagctgga 


cgcggatgaa 


caggcagaca 


tctgtgaatc 


2722 


gcttcacgac cacgctgatg agctttaccg cagctgcctc gcgcgtttcg gtgatgacgg 


2782 


tgaaaacctc 


tgacacatgc 


agctcccgga 


gaeggtcaca 


gcttgtctgt 


aaqcqqatqc 


2842 


cgggagcaga 


caagcccgtc 


agggegegtc 


agcgggtgtt 

ZJ ZJ ZJ ZJ ZJ 


qqcqqqtqtc 

zj zj zj zj zj zj 


qqqqcQcaqc 

ZJZJZJZJ^ZJ^^ZJ^ 


2902 


catgacccag 


tcacgtagcg 


atageggagt 


gtatactggc 


ttaactatgc 


qqcatcaaacr 


2962 


cagattgtac 


tgagagtgca 


ecatatatge 


ggtgtgaaat 


accgcacaga 


tcrcataacraa 

Zj zj zj 


3022 


gaaaataccg 


catcaggcgc 


tcttccgctt 


cctcgctcac 


tqactccrctcr 


CQCtCQQtca 


3082 


ttcggctgcg 


gcgagcggta 


tcagctcact 


caaaggeggt 


aatacggtta 


tccacagaat 


3142 


caggggataa 


cgcaggaaag 


aacatgtgag 


caaaaggeca 


qcaaaaqqcc 


aggaacegta 


3202 


aaaaggccgc 


gttgctggcg 


tttttccata 


ggctccgccc 


ccctgacgag 


catcacaaaa 


3262 


atcgacgctc 


aagtcagagg 


tggegaaace 


cgacaggact 


ataaagatac 


caggegttte 


3322 


cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


gccgcttacc 


qqatacctqt 


3382 


ccgcctttct 


cccttcggga 


agcgtggcgc 


tttctcatag 


ctcacgctgt 


aggtatctca 


3442 


gttcggtgta 


ggtcgttcgc 


tccaagctgg 


gctgtgtgca 


cgaacccccc 


qttcaqcccq 


3502 


accgctgcgc 


cttatccggt 


aactategtc 


ttgagtccaa 


cceggtaaga 


cacgacttat 


3562 


cgccactggc 


agcagccact 


ggtaacagga 


ttagcagagc 


gaggtatgta 


ggcggtgcta 


3622 


cagagttctt 


gaagtggtgg 


cctaactacg 


gctacactag 


aaggacagta 


tttggtatct 


3682 


gcgctctgct 


gaagccagtt 


acctteggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


3742 


aaaccaccgc 


tggtagcggt 


ggtttttttg 


tttgeaagea 


gcagattacg 


cgcagaaaaa 


3802 


aaggatctca 


agaagatcct 


ttgatctttt 


etaeggggtc 


tgacgctcag 


tggaacgaaa 


3862 


actcacgtta 


agggattttg 


gtcatgaaca 


ataaaactgt 


ctgettacat 


aaacagtaat 


3922 


acaaggggtg 


tt atg age 
Met Ser 


cat att caa egg gaa acg tct tgc tct agg ccg 
His lie Gin Arg Glu Thr Ser Cys Ser Arg Pro 


3973 



365 370 
cga tta aat tec aac atg gat get gat tta tat ggg tat aaa tgg get 4 021 
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Arg Leu Asn Ser Asn Met Asp Ala Asp Leu Tyr Gly Tyr Lys Trp Ala 
375 380 385 

cgc gat aat gtc ggg caa tea ggt gcg aca ate tat cga ttg tat ggg 4069 
Arg Asp Asn Val Gly Gin Ser Gly Ala Thr He Tyr Arg Leu Tyr Gly 
390 395 400 405 

aag ccc gat gcg cca gag ttg ttt ctg aaa cat ggc aaa ggt age gtt 4117 
Lys Pro Asp Ala Pro Glu Leu Phe Leu Lys His Gly Lys Gly Ser Val 
410 415 420 

gee aat gat gtt aca gat gag atg gtc aga eta aac tgg ctg acg gaa 4165 
Ala Asn Asp Val Thr Asp Glu Met Val Arg Leu Asn Trp Leu Thr Glu 
425 430 435 

ttt atg Get ctt ccg ace ate aag cat ttt ate cgt act cct gat gat 4213 
Phe Met Pro Leu Pro Thr He Lys His Phe He Arg Thr Pro Asp Asp 
440 445 450 

gca tgg tta etc acc act gcg ate ccc ggg aaa aca gca ttc cag gta 4261 
Ala Trp Leu Leu Thr Thr Ala He Pro Gly Lys Thr Ala Phe Gin Val 
455 460 465 

tta gaa gaa tat cct gat tea ggt gaa aat att gtt gat gcg ctg gca 43 09 

Leu Glu Glu Tyr Pro Asp Ser Gly Glu Asn He Val Asp Ala Leu Ala 
470 475 480 " 485 

gtg ttc ctg cgc egg ttg cat teg att cct gtt tgt aat tgt cct ttt 4357 
Val Phe Leu Arg Arg Leu His Ser He Pro Val Cys Asn Cys Pro Phe 
490 495 500 

aac age gat cgc gta ttt cgt etc get cag gcg caa tea cga atg aat 44 05 

Asn Ser Asp Arg Val Phe Arg Leu Ala Gin Ala Gin Ser Arg Met Asn 
505 510 515 

aac ggt ttg gtt gat gcg agt gat ttt gat gac gag cgt aat ggc tgg 4453 
Asn Gly Leu Val Asp Ala Ser Asp Phe Asp Asp Glu Arg Asn Gly Trp 
520 525 530 

cct gtt gaa caa gtc tgg aaa gaa atg cat aaa ctt ttg cca ttc tea 4501 
Pro Val Glu Gin Val Trp Lys Glu Met His Lys Leu Leu Pro Phe Ser 
535 540 545 

ccg gat tea gtc gtc act cat ggt gat ttc tea ctt gat aac ctt att 4549 
Pro Asp Ser Val Val Thr His Gly Asp Phe Ser Leu Asp Asn Leu He 
550 555 560 *" 565 

ttt gac gag ggg aaa tta ata ggt tgt att gat gtt gga cga gtc gga 4597 
Phe Asp Glu Gly Lys Leu He Gly Cys He Asp Val Gly Arg Val Gly 
570 575 580 
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ate gca gac cga tac cag gat ctt gec ate eta tgg aac tgc etc ggt 4645 
lie Ala Asp Arg Tyr Gin Asp Leu Ala lie Leu Trp Asn Cys Leu Gly 
585 590 595 

gag ttt tct cct tea tta cag aaa egg ctt ttt caa aaa tat ggt att 4693 
Glu Phe Ser Pro Ser Leu Gin Lys Arg Leu Phe Gin Lys Tyr Gly lie 
600 605 610 

gat aat cct gat atg aat aaa ttg cag ttt cat ttg atg etc gat gag 4 741 

Asp Asn Pro Asp Met Asn Lys Leu Gin Phe His Leu Met Leu Asp Glu 
615 620 625 

ttt ttc taagaattaa ttcatgagcg gatacatatt tgaatgtatt tagaaaaata 
Phe Phe 
630 

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgaaatt gtaaacgtta 
atattttgtt aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg 
ccgaaategg caaaatccct tataaatcaa aagaatagac cgagataggg ttgagtgttg 
ttccagtttg gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa 
aaacegtcta teagggegat ggcccactac gtgaaccatc accctaatca agttttttgg 
ggtcgaggtg cegtaaagea etaaategga accctaaagg gagcccccga tttagagctt 
gaeggggaaa gccggcgaac gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg 
etagggeget ggcaagtgta gcggtcacgc tgegegtaac caccacaccc gccgcgctta 
atgcgccgct acagggegeg tcccattcgc ca 



<210> 15 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer, HT-YscF Start 

<400> 15 

ccggatccga tgagtaactt ctctggattt 30 

<210> 16 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



4797 

4857 
4917 
4977 
5037 
5097 
5157 
5217 
5277 
5309 
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<220> 

<223> primer, HT-YscF Stop 



<400> 16 

ccgctcgagt gggaacttct gtaggatgcc 3 0 



<210> 


17 




<211> 


87 




<212> 


PRT 




<213> 


Yersinia pestis KIM5 




<220> 






<223> 


YscF amino acid seuquence from Y, 


, pestis KIMS 


<220> 






<223> 


sequence can be found at MedLine 


accession number 


<400> 


17 





Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp He Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser He Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys Trp Ser Val He 
50 55 60 



Tyr Asn He Asn Ser Thr He Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly He Leu Gin Lys Phe Pro 
85 



<210> 18 

<211> 87 

<212> PRT 

<213> Yersinia pestis C092 
<220> 
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<223> YscF amino acid sequence from Y. pestis C092 
<220> 

<223> sequence can be found at MedLine accession number NC_003131 
<400> 18 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro 
85 



<210> 19 
<211> 87 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Consensus sequence for YscF 
<400> 19 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



21 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro 
85 



<210> 20 

<211> 4884 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pBAD18-YscF 



<220> 

<221> misc_f eature 
<222> (96) . . (974) 

<223> araC 

<220> 

<221> misc_f eature 
<222> (1003) . . (1020) 

<223> operator 02 

<220> 

<221> promoter 

<222> (1125) . . (1153) 

<223> Pc promoter 

<220> 

<221> misc_f eature 

<222> (1161) . . (1182) 

<223> operator 01 

<220> 

<221> misc_f eature 

<222> (1204) . . (1217) 

<223> CAP site 

<220> 

<221> misc_feature 

<222> (1213) . . (1251) 
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<223> operator 12 + II 



<220> 

<221> promoter 

<222> (1250) . . (1277) 

<223> PBAD promoter 

<220> 

<221> CDS 

<222> (1323) . . (1586) 

<223> YscF 

<220> 

<221> misc_f eature 

<222> (1634) . . (2059) 

<223>' rrnB 

<220> 

<221> terminator 

<222> (1634) . . (2059) 

<223> rrnB 

<220> 

<221> promoter 

<222> (2114) . . (2120) 

<223> bla P3 promoter 

<220> 

<221> misc_f eature 

<222> (2152) . . (3015) 

<223> bla 

<220> 

<221> rep_origin 

<222> (3051) . . (3509) 

<223> M13 origin 

<220> 

<221> rep_origin 

<222> (3515) . . (4212) 

<223> pBR322 origin 

<400> 20 

atcgatgcat aatgtgcctg tcaaatggac gaagcaggga ttctgcaaac cctatgctac 60 

tccgtcaagc cgtcaattgt ctgattcgtt accaattatg acaacttgac ggctacatca 120 

ttcacttttt cttcacaacc ggcacggaac tcgctcgggc tggccccggt gcatttttta 180 

aatacccgcg agaaatagag ttgatcgtca aaaccaacat tgcgaccgac ggtggcgata 24 0 
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ggcatccggg tggtgctcaa aagcagcttc gcctggctga tacgttggtc ctcgcgccag 3 00 

cttaagacgc taatccctaa ctgctggcgg aaaagatgtg acagacgcga cggcgacaag 3 60 

caaacatgct gtgcgacgct ggcgatatca aaattgctgt ctgccaggtg atcgctgatg 420 

tactgacaag cctcgcgtac ccgattatcc atcggtggat ggagcgactc gttaatcgct 480 

tccatgcgcc gcagtaacaa ttgctcaagc agatttatcg ccagcagctc cgaatagcgc 540 

ccttcccctt gcccggcgtt aatgatttgc ccaaacaggt cgctgaaatg cggctggtgc 600 

gcttcatccg ggcgaaagaa ccccgtattg gcaaatattg acggccagtt aagccattca 660 

tgccagtagg cgcgcggacg aaagtaaacc cactggtgat accattcgcg agcctccgga 72 0 

tgacgaccgt agtgatgaat ctctcctggc gggaacagca aaatatcacc cggtcggcaa 780 

acaaattctc gtccctgatt tttcaccacc ccctgaccgc gaatggtgag attgagaata 84 0 

taacctttca ttcccagcgg tcggtcgata aaaaaatcga gataaccgtt ggcctcaatc 900 

ggcgttaaac ccgccaccag atgggcatta aacgagtatc ccggcagcag gggatcattt 960 

tgcgcttcag ccatactttt catactcccg ccattcagag aagaaaccaa ttgtccatat 1020 

tgcatcagac attgccgtca ctgcgtcttt tactggctct tctcgctaac caaaccggta 1080 

accccgctta ttaaaagcat tctgtaacaa agcgggacca aagccatgac aaaaacgcgt 114 0 

aacaaaagtg tctataatca cggcagaaaa gtccacattg attatttgca cggcgtcaca 12 00 

ctttgctatg ccatagcatt tttatccata agattagcgg atcctacctg acgcttttta 1260 

tcgcaactct ctactgtttc tccatacccg tttttttggg ctagcgaatt caggaggaaa 132 0 

eg atg agt aac ttc tct gga ttt acg aaa gga acc gat ate gca gac 13 67 
Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp 
15 10 15 

tta gat gcg gtg get caa acg etc aag aag cca gca gac gat gca aac 1415 
Leu Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn 
20 25 30 

aaa gcg gtt aat gac teg ata gca gca ttg aaa gat aag cct gac aac 14 63 
Lys Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn 
35 40 45 

ccg gcg eta ctt get gac tta caa cat tea att aat aaa tgg teg gta 1511 
Pro Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val 
50 55 60 
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att tac aat ata aac tea acc ata gtt cgt age atg aaa gac tta atg 1559 
lie Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys Asp Leu Met 
65 70 75 

caa ggc ate eta cag aag ttc cca taa ggatcccgcg gggatcctct 1606 
Gin Gly lie Leu Gin Lys Phe Pro 
80 85 

agagtcgacc tgcaggcatg caagcttggc tgttttggcg gatgagagaa gattttcagc 1666 

ctgatacaga ttaaatcaga aegcagaage ggtctgataa aacagaattt gcctggcggc 172 6 

agtagegegg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc 1786 

gatggtagtg tggggtctcc ecatgegaga gtagggaact gccaggcatc aaataaaacg 1846 

aaaggctcag tcgaaagact gggecttteg ttttatctgt tgtttgtcgg tgaaegctet 1906 

cctgagtagg acaaatccgc egggagegga tttgaacgtt gcgaagcaac ggeceggagg 1966 

gtggcgggca ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct 2 026 

gaeggatgge etttttgegt ttctacaaac tcttttgttt atttttctaa atacattcaa 2086 

atatgtatcc gctcatgaga caataaccct gataaatget tcaataatat tgaaaaagga 2146 

agagtatgag tattcaacat ttccgtgtcg cccttattcc ettttttgeg geattttgee 2206 

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgetgaa gatcagttgg 2266 

gtgcacgagt gggttacatc gaactggatc teaacagegg taagatcctt gagagttttc 2326 

gccccgaaga aegttttcca atgatgagca cttttaaagt tetgetatgt ggegeggtat 2386 

tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 244 6 

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 2506 

aattatgcag tgetgecata accatgagtg ataacactgc ggecaactta cttctgacaa 2566 

egateggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 2626 

gecttgateg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 2686 

cgatgcctgc agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 2746 

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 2806 

tgcgctcggc ccttccggct ggctggttta ttgctgataa atetggagee ggtgagcgtg 2866 
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ggtctcgcgg tatcattgca gcactggggc 


cagatggtaa gccctcccgt 


atcgtagtta 


2926 


tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc 


gctgagatag 


2986 


gtgcctcact 


gattaagcat 


tggtaactgt 


cagaccaagt ttactcatat 


atactttaga 


3046 


ttgatttacg 


cgccctgtag 


cggcgcatta 


agcgcggcqq qtqtqqtqqt 

— ' ZJ ZJ ZJ ZJ ZJ ZJ ZJ ZJZj ZJZJ 


tacgcgcagc 


3106 


gtgaccgcta 


cacttgccag 


cgccctagcg 


cccgctcctt tcgctttctt 


cccttccttt 


3166 


ctcgccacgt 


tcgccggctt 


tccccgtcaa 


gctctaaatc gggggctccc 

_ f Zj ZJ ZJ ZJ 


tttagggttc 


3226 


cgatttagtg 


ctttacggca 


cctcgacccc 


aaaaaacttg atttgggtga 


tggttcacgt 


3286 


agtgggccat 


cgccctgata 


gacggttttt 


cgccctttga cgttggagtc 


cacgttcttt 


3346 


aatagtggac 


tcttgttcca 


aacttgaaca 


acactcaacc ctatctcggg 


ctattctttt 


3406 


gatttataag 


ggattttgcc 


gatttcggcc 


tattggttaa aaaatgagct 


gatttaacaa 


3466 


aaatttaacg 


cgaattttaa 


caaaatatta 


acgtttacaa tttaaaagga 


tctaggtgaa 


3526 


gatccttttt 


gataatctca 


tgaccaaaat 


cccttaacgt gagttttcgt 


tccactgagc 


3586 


gtcagacccc 


gtagaaaaga 


tcaaaggatc 


ttcttgagat cctttttttc 


tgcgcgtaat 


3646 


ctgctgcttg 


caaacaaaaa 


aaccaccgct 


accagcggtg gtttgtttgc 


cggatcaaga 


3706 


gctaccaact 


ctttttccga 


aggtaactgg 


cttcagcaga gcgcagatac 


caaatactgt 


3766 


ccttctagtg 


tagccgtagt 


taggccacca 


cttcaagaac tctgtagcac 


cgcctacata 


3826 


cctcgctctg 


ctaatcctgt 


taccagtggc 


tgctgccagt ggcgataagt 


cgtgtcttac 


3886 


cgggttggac 


tcaagacgat 


agttaccgga 


taaggcgcag cggtcggqct 

— ' — ' -J ZJ ZJ ZJ ZJ ZJ ZJ 


gaacgggggg 


3946 


ttcgtgcaca 


cagcccagct 


tggagcgaac 


gacctacacc gaactgagat 


acctacagcg 


4006 


tgagctatga 


gaaagcgcca 


cgcttcccga 


acjggagaaag gcggacaggt 


atccggtaag 


4066 


cggcagggtc 


ggaacaggag 


agcgcacgag 


ggagcttcca gggggaaacg 


cctggtatct 


4126 


ttatagtcct 


gtcgggtttc 


gccacctctg 


acttgagcgt cgatttttgt 


gatgctcgtc 


4186 


aggggggcgg 


agcctatgga 


aaaacgccag 


caacgcggcc tttttacggt 


tcctggcctt 


4246 


ttgctggcct 


tttgctcaca 


tgttctttcc 


tgcgttatcc cctgattctg 


tggataaccg 


4306 


tattaccgcc 


tttgagtgag 


ctgataccgc 


tcgccgcagc cgaacgaccg 


agcgcagcga 


4366 


gtcagtgagc 


gaggaagcgg 


aagagcgcct 


gatgcggtat tttctcctta 


cgcatctgtg 


4426 
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cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 


4486 


aagccagtat acactccgct atcgctacgt gactgggtca tggctgcgcc 


ccgacacccg 


4546 


ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 


ttacagacaa 


4606 


gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 


accgaaacgc 


4666 


gcgaggcagc aaggagatgg cgcccaacag tcccccggcc acggggcctg 


ccaccatacc 


4726 


cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 


catcggtgat 


4786 


gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 


ccacgatgcg 


4846 


tccggcgtag aggatctgct catgtttgac agcttatc 




4884 
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